Appearance of Random Matrix Theory in deep learning
نویسندگان
چکیده
We investigate the local spectral statistics of loss surface Hessians artificial neural networks, where we discover excellent agreement with Gaussian Orthogonal Ensemble across several network architectures and datasets. These results shed new light on applicability Random Matrix Theory to modelling networks suggest a previously unrecognised role for it in study surfaces deep learning. Inspired by these observations, propose novel model true consistent our which allows Hessian densities rank degeneracy outliers, extensively observed practice, predicts growing independence gradients as function distance weight-space. further importance find, contrast previous work, that exponential hardness locating global minimum has practical consequences achieving state art performance.
منابع مشابه
Nonlinear random matrix theory for deep learning Nonlinear random matrix theory for deep learning
Neural network configurations with random weights play an important role in the analysis of deep learning. They define the initial loss landscape and are closely related to kernel and random feature methods. Despite the fact that these networks are built out of random matrices, the vast and powerful machinery of random matrix theory has so far found limited success in studying them. A main obst...
متن کاملNonlinear random matrix theory for deep learning
Neural network configurations with random weights play an important role in the analysis of deep learning. They define the initial loss landscape and are closely related to kernel and random feature methods. Despite the fact that these networks are built out of random matrices, the vast and powerful machinery of random matrix theory has so far found limited success in studying them. A main obst...
متن کاملUniversality in Random Matrix Theory
which is the Central Limit Theorem. In principle, all the random variables X1, X2, · · · , XN can be of order 1, hence SN ∼ 1 as well, but the probability of having such a rare event is incredibly small. We can even estimate the bound on the probability for the rare event from the large deviation principle. A similar phenomenon happens when we form a large matrix from i.i.d. random variables an...
متن کاملDevelopments in Random Matrix Theory
In this preface to the Journal of Physics A, Special Edition on Random Matrix Theory, we give a review of the main historical developments of random matrix theory. A short summary of the papers that appear in this special edition is also given.
متن کاملRandom Matrix Theory
Random matrix theory is usually taught as a sequence of several graduate courses; we have 16 lectures, so we will give a very brief introduction. Some relevant books for the course: • G. Anderson, A. Guionnet, O. Zeitouni. An introduction to random matrices. [1] • A. Guionnet. Large random matrices: lectures on macroscopic asymptotics. • M. L. Mehta. Random matrices. The study of random matrice...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Physica D: Nonlinear Phenomena
سال: 2022
ISSN: ['1872-8022', '0167-2789']
DOI: https://doi.org/10.1016/j.physa.2021.126742